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1.  Introduction  and  Statement  of  Main  Results 


^  We  present  some  mathematical  analysis  for  a  class  of  curve  fitting 
algorithms  labeled  "projection  pursuit"  algorithms, by  iFt^dman  and  Stuetzle 
These  algorithms  approximate  a  general  function  of  p  variables 
by  a  sum  of  non-linear  functions  of  linear  combinations^ 


(1.1) 


f(x  ,  ...,x  )  -  l  g  (a  x  +  .. .  4- a  x  )  A 
1  P  i  ii  i  ip 


i-1 


In  (1.1),  f  is  a  given  function  and  univariate,  non-linear  functions 
and  linear  combinations  a.  ...  +  a,  x  are  sought  so  that  a  reasonable 

QtU  lp  ” 

approximation  is  attained.  ^JPch  approximation  is  computationally  feasible 
and  performs  well  in  examples  of  nonparametric  regression  with  noisy  data, 
high  dimensional  density  estimation,  and  multidimensional  spline  approximation. 
In  addition  to  the  articles  of  Friedman  and  Stuetzle  cited  above  see  Friedman 
and  Tukey  (1974),  Friedman,  Grosse  and  Stuetzle  (1981)  for  examples  and 
computational  details.  Huber  (1981  a,  b)  begins  to  connect  the  algorithms  to 
statistical  theory.  ^This  note  treats  the  algorithms  from  the  point  of  view 

of  approximation  theory. 

It  is  easy  to  show  that  approximation  is  always  possible.  _ _ 

i#  ± 

THEOREM  1.  Functions  of  the  form  Ec^e-  **  ,  with  a±  real,  a  a  vector  of 

nonnegative  integers,  and  x  “  (x^ . x^)  are  dense  in  the  continuous  real 

valued  functions  on  [0,  l]p  under  the  maximum  deviation  norm. 


Proof.  The  functions  e~*£  separate  points  of  [0,  1]P  and  are  closed  under 
multiplication.  Finite  linear  combinations  of  such  functions  form  a  point 
separating  algebra  which  is  dense  because  of  the  Stone-Weirstrass  theorem.  ■ 


THEOREM  2.  Functions  of  the  form 

cos  (2  it  a*  •  x)  +  8^  sin  (2  tt  b*  •  x) 

are  dense  in  L^[0»  1]**  . 

Proof*  Any  function  in  L^[0,  l]p  can  be  well  approximated  by  its  Fourier 
expansion.  See  Volume  2  of  Zygmund  (1959)  and  the  survey  article  by  Ash 
(1976)  for  further  details  and  refinements.  ■ 

Sometimes  equality  is  possible  in  (1.1).  For  example 
xy  -  £(x+y)2  -  ^(x-y)2 
max(x,  y)  -  |x+  y  I'  +  -|  |x  -  y  | 

21  47  41  4  2^  14 

(xy)  *  "7  (x+  y)  + - o(x-y) - r  (x+  2y) - ,(x+-£y)  . 

4*3  2  •  3J  3 


In  what  follows  we  will  focus  on  conditions  for  equality  in  (1.1)  as  a  method 
of  determining  examples  to  test,  compare,  and  evaluate  algorithms.  Consider 
first  a  smooth  function  of  2  variables  of  the  special  form 


Clearly 


f  (x,  y)  -  g(ax+  by)  . 


=  0  . 


If  f  has  the  form 

(1.2) 


f(x»  y)  -  l  g1(a1*-*-b  y) 
i-1  1 


2 


then  the  differential  operator 


n 

n 

l-i 


■*1  a  ia  i 
dx  Oy 


i*  n-i 


applied  to  f  is  identically  zero.  The  next  theorem  gives  a  converse. 

THEOREM  3.  Let  f  €  Cn[0t  1]  .  Suppose  that  for  some  real  numbers  cn>  ...t  c  , 

0  n 

n 

the  operator  £  c.  — : — — r  applied  to  f  is  identically  zero.  If  the  polyno- 
n  ii-0  1  9x  3yn_1 

mial  l  c  z  has  distinct  real  zeros  then  (1.2)  holds  for  some  (a  ,  b  )  .  The 
i-0  11 

lines  a^x  +  b^y  are  all  distinct. 

Theorem  3  is  proved  in  Section  2  which  also  contains  a  discussion  of 
techniques  for  finding  directions  (a^ b  )  given  f  .  Some  applications  of 
Theorem  3  are  contained  in  the  following  examples. 

APPLICATION  1.  The  functions  e and  sin  xy  cannot  be  written  in  the  form 

(1.1)  for  any  finite  n.  Indeed,  the  equation  £c.  — - - r  {e  y]  =0  implies 

1  3xidyn"i 

c±  S  0  and  the  associated  polynomial  has  complex  roots. 

APPLICATION  2.  Let  f(x,y)  be  a  polynomial  of  degree  m  .  Then 

m 

f(x»  y)  ■  l  g.(a  x+b  y) 

1-1  1  1  1 

where  each  g^  is  a  polynomial  of  degree  at  most  m .  This  follows  by  elimi¬ 
nating  manipulations  from  Theorem  3*  Thus,  any  polynomial  in  2  variables 

2 

can  be  represented  exactly.  Since  polynomials  are  dense  in  C[0,  1]  >  this 

gives  another  proof  of  denseness  of  projection  pursuit  approximations.  A  dif¬ 
ferent  proof  of  this  result  is  in  Logan  and  Shepp  (1975)  •  An  extension  to  more 
than  two  variables  is  in  Proposition  1  of  Section  2. 


3 


APPLICATION  3.  Representations  of  the  form  (1.1)  are  not  necessarily  unique. 
For  example 

2  2 
xy  ■  c(ax  +  by)  -  c(ax-by) 

2  2 

for  any  a  and  b  satisfying  ab  0,  a  +  b  -1  with  c  *  l/4ab  . 

Writing  a  ■  cos  0t  b  ■  sin  0  ,  any  non-coordinate  direction  can  be  chosen 

for  the  quadratic  .  The  second  direction  is  forced  as  orthogonal  to  this. 

This  suggests  that  substantive  interpretation  of  the  linear  combinations 

(a^,  *S  difficult.  For  a  more  ambitious  example,  consider  the  function 
2 

(xy)  .  This  is  of  4th  degree.  Use  of  Theorem  3  as  outlined  in  Section  2, 

2 

shows  that  (xy)  cannot  be  expressed  as  a  sum  of  n  »  3  or  fewer  terms  in 
(1.1).  Four  terms  of  4th  degree  suffice: 

(xy)2  -  a^x  +  bjy)4  +  c*2(x  +  b2y)4  +  a.j(x  +  b3y)4  +  a^x  +  b^y)4  , 

where  b^,  b^>  b^>  b^  are  chosen  as  distinct,  and  satisfying 

blb2  +  blb3  +  blb4  +  b2b3  +  b2b4  +  b3b4  “  0  * 

Then  a^9  ctj*  are  determined  by 


where  the  sum  and  product  are  over  j  +  i  .  This  clearly  defines  a  three 
dimensional  family  of  solutions. 

In  thinking  about  non-uniqueness,  we  observed  that  the  only  examples  of 
non-unique  representation  we  could  find  are  polynomials.  Indeed,  polynomials 
have  the  following  strong  non-uniqueness  property. 


4 


DEFINITION.  A  function  f(x,  y)  has  strongly  non-unique  representations  if 
there  are  two  sets  of  directions  { ^ai#  ®i^i»l*  a11  dlstinct 
from  each  other,  such  that 

n  m 

f(x,  y)  »  l  g  (a  x+b  y)  -  £  h  (a  x+ g  y) 

i-1  1  i-1  1  1  1 

for  some  and  h^. 

Polynomials  have  strongly  non-unique  representations:  if  deg  P(x,  y)  *  n, 
and  (a^,  are  any  distinct  directions.  Theorem  1  implies  that  P(x,  y)  can 

be  represented  In  these  directions.  It  turns  out  that  only  polynomials  have 

this  property.  This  is  a  consequence  of 

THEOREM  4.  Let  f(x,  y)  e  Cn+m[0,  1]^.  Suppose  that  for  some  directions 

iai»  b1}1=1  and 

n  m 

f(x,  y)  *>  l  g  (a  X+b  y)  =  £  h  (a  x+6  y) 

i=l  1  1  i-1  1  1  1 


If,  for  some  i,  (a^,  8^)  is  distinct  from  (a^b^),  1  <_  j  £  n,  then  h^  is  a 
polynomial  of  degree  at  most  n  +  m  -  2. 

9  3 


Proof. 


ut  * '  jji  (6i  £ '  «j  •  a-d  ■  ■  ^  (,j  -h  -  bi  » > 


Then, 


so 


for 


Af  -  EAg^  is  a  sum  of  functions  in  directions  (a^,  b^) 


0  -  BAF  -  Ch^®*11"13  (o^x  +  g^) 


C  -  {  n  (o  b  -  B-a  )}  •  {  n  (a.g  -g.a.)}  *  0. 
J-l  3  1  J  tfi  1  3  13 


Thus,  h^  is  a  polynomial  of  degree  at  most  m  +  n  -  2.  ■ 


COROLLARY .  A  function  f(x,  y)  has  strongly  non-unique  representations  if 
and  only  if  f  is  a  polynomial. 

Remark.  The  statement  and  proof  of  Theorem  4  carry  over  to  functions  of 
more  than  two  variables  in  a  straightforward  way.  See  Lemma  1  of  Section  2. 

How  are  the  curve  fitting  algorithms  affected  by  non-uniqueness?  To 
understand  this,  we  performed  the  following  experiment.  On  each  trial  200 
independent,  mean  zero,  variance  1,  normal  points  (x^,  y^)  were  generated. 

The  algorithm  of  Friedman  and  Stuetzle  was  given  x^,  y^,  and  x^y^  + 
with  normally  distributed*  mean  zero,  variance  .1  errors.  We  expected 
the  directions  fit  to  change  a  great  deal.  In  each  of  100  trials  the  al¬ 
gorithm  fit  univariate  functions  in  directions  (1,  1)  and  (1,-1)  (to  two 
decimal  places) . 

To  understand  this,  it  is  important  to  consider  the  nature  of  the  al¬ 
gorithm.  At  each  stage,  it  chooses  the  direction  which  minimizes  the  residual 
sum  of  squares  when  the  best  fitting  function  in  that  direction  is  subtracted 
off.  See  Friedman  and  Stuetzle  (1981)  for  a  careful  description.  If  the 
sample  size  is  large,  the  algorithm  will  behave  in  the  same  way  as  the 
infinite  population  analog.  Thus,  let  X,  Y  be  independent  Gaussian  variables 

with  mean  zero  and  variance  1.  Consider  approximating  XY  by  the  best  linear 

2 

combination  of  the  form  aX  +  bY.  For  fixed  a  and  b,  the  L  norm  is  minimized 

2  2 

by  the  function  E(XY|aX+bY).  Which  values  of  a  and  b,  subject  to  a  +  b  *  1, 
minimize 

E{XY  -  E(XY|aX+bY)}2? 

Let  us  show  that  the  minimum  is  achieved  at  a  ■  ±b  -  ±l//2  .  Let  U  *  aX  +  bY, 

V  -  aX  -  bY.  Than  U  and  V  are  Independent  standard  normal  and 
”  “  4ib  *<»!•*  ■►»>  *  4^  {U2-!}-  Then, 


E{XY  -  E(XY|aX+bY)}2  -  - -  e{(U2-V2)  -  (U2-l)}2  -  - ~  E{V2-l}2. 

(4ab)  (4ab) 

Since  the  distribution  of  V  does  not  depend  on  a  and  b,  the  right  side  is 
2  2  1 

minimized  when  a  *  b  *  ^  .  When  the  best  fitting  function  is  subtracted 
off,  the  second  stage  of  the  algorithm  subtracts  off  a  quadratic  in  the  ortho¬ 
gonal  direction  and  the  algorithm  terminates  after  two  steps.  The  same 

2 

result  can  be  shown  to  hold  when  X  and  Y  are  chosen  uniformly  in  [-1,  1]  . 

Similar  computations  can  be  instructively  carried  out  for  functions 

2 

other  than  XY.  For  example,  consider  (XY)  .  David  Donoho  has  shown  that  if 
X  and  Y  are  normally  distributed  the  algorithm  chooses  the  four  directions 
(1,  1),  (1,-1),  (1,  0),  (0,1).  Moreover,  Donoho  can  prove  that  the  approx¬ 
imation  does  not  terminate  after  four  steps,  even  though  the  function  can 
be  expressed  as  a  sum  of  four  4th  degree  polynomials.  Infinitely  many  steps 
are  required  -  successive  terms  being  cyclically  added  in  each  of  the  four 
directions . 

Donoho  and  Ian  Johnstone  have  independently  proved  that  for  normally 
distributed  X  and  Y,  the  greedy  approximation,  which  at  each  stage  finds 
the  a  and  b  to  minimize 

E{fn(X,  Y)  -  E{fn(X,Y)|aX+bY}}2, 

^  t2 

converges  in  L  . 

These  results  underscore  a  property  of  the  projection  pursuit  algorithm: 

2 

the  directions  it  chooses  are  the  directions  that  minimize  the  L  error.  The 
situation  is  somewhat  like  finding  the  principal  components  of  a  covariance 
matrix.  There  are  many  possible  bases,  but  the  directions  chosen  have  a 
well  defined  interpretation  in  terms  of  maximum  reduction  of  variance. 


7 


APPLICATION  4.  Even  if  the  directions  (a^,  b^)  are  fixed,  the  representation 
need  not  be  unique*  Suppose  that  n  is  the  smallest  integer  such  that 


If  also 


then 


n 

f(x.  y)  *  l  g,  u.x+b.y)  . 
i-1  1  1  1 


n 

f(x,  y)  ■  l  h  (a  x+  b.y)  , 
i-1 


f^(t)  -  hi(t)  -  p^t)  ,  1  <.  i  <,  n 


with  a  polynomial  of  degree  at  most  n  -  1  .  The  polynomials  p^  can  be 

chosen  in  an  arbitrary  way  subject  to  the  constraint  Zp^  =  0  .  In  particular, 
any  n  -  1  of  the  p^  can  be  chosen  arbitrarily  and  a  final  polynomial  can  be 
found  to  satisfy  the  constraint*  These  results  all  follow  easily  from  Theorem  3 


indeed  the  operator  L 


n 


,  js_  JL 

[  j  3x  aj  3yJ 


applied  to  f(x,  y)  gives 


hin"1)(aix+biy) 


g(n_1)  (a  x+ b.y)  II  (b  a  -  a  b  ) 
1  J  1  J  1 


The  products  are  non-vanishing  because  the  directions  are  distinct.  It  follows 
that  h^  differs  from  by  at  most  a  polynomial  of  degree  n  -  1  ,  and  that  an 
arbitrary  polynomial  may  be  added  subject  to  the  constraint. 

In  the  special  case  n  *  2  ,  Theorem  3  was  given  by  Dotson  (1968)  who 
suggests  further  application  to  factoring  probability  densities  and  separation 
of  variables. 

The  generalization  to  dimension  greater  than  two  is  not  as  neat.  We  give 

a  result  for  3-dimensions  which  generalize  to  p-dimensions.  Suppose  that  for 

13  3 

n  distinct  directions  a  €  »  ,  a  function  f  can  be  represented,  for  x  e  It  ,  as 

n  . 

(1.3)  f(x)  -  l  g.  (a1  •  x)  . 

i-1  1 


8 


Let  n1  -  {P€  IR^  :  p  •  a* -  0  }.  Let  V  "  ( ).  Clearly, 


i  1  i 

n  (p  •  V)f  =  0  for  all  p  €  n  . 

i-1 


This  condition  is  not  sufficient.  To  see  this,  consider  the  function 

1  2 

f(xr  x2*  *3)  *  xlx2  and  tlie  three  directions  a  -  (1,  0,0),  a  *  (0,  1,  0) 

3  i  i  i 

and  a  -  (0,  0,  1).  For  any  non-zero  p  €  II  ,  the  operator  II (p  •  V)  applied 

to  f  is  zero.  Yet,  f  cannot  be  written  as  f(x)  *  f^(x^)  +  f^x^)  +  ^3^X3^ ’ 

4 

Here  is  a  second  example.  Any  operator  of  the  form  II  (p  •  V) ,  with  p  non- 

i-1  1  i 

zero,  applied  to  the  function  6x^x^x^  yields  zero.  Despite  this,  7  directions 
are  needed: 

,  3.3,3,.  ,  *  >.3  ,  .  .3  ,  ,  .3  ,  ,  v3 

6x1X2X3  53  x!  +  x2  +  x3  +  ^X1  +  X2  +  X3  "  ^Xl+X2'  "  <xl+x3>  "  ^X2+X3  . 

The  condition  is  sufficient  "up  to  polynomials": 

THEOREM  5.  Let  ar  be  distinct,  non-zero,  directions  in  IR^.  Let  Ilr  be  the 

3  r  n  3 

plane  {peIR  :a  •  p=0}.  A  function  f  e  C  (R)  has  the  form 

n 

f (x)  -  l  g  (ar  •  x)  +  P(x) 
r-l  r 


for  a  polynomial  P  of  degree  less  than  n,  if  and  only  if 


(1.4)  II  (p  •  V)f  *  0  for  all  pr  e  Hr . 

i=l 

Remark.  As  noted  above,  condition  (1.4)  is  not  sufficient  to  ensure  that 
representation  (1.3).  If  (1.3)  holds,  there  are  other  obvious  necessary 
conditions:  If  e  IT*  0  ,  then 

(p1^  •  V)  gi(ai  •  x)  «  (p^  •  V)  gjCa1  •  x)  -  0  . 

Thus  f  is  annihilated  by  differential  operators  of  degree  , • • • » 


n-1. 


Unfortunately,  even  these  conditions  are  not  sufficient.  H.  Royden  in  un¬ 
published  work,  has  determined  necessary  and  sufficient  conditions  of  a 
rather  different  type.  In  Proposition  1  of  Section  2  we  show  that  any  poly¬ 
nomial  P  can  be  written  as  a  sum  of  univariate  polynomials  of  linear  combi- 
nations.  If  deg  P  =  k  ,  then  (.  2  )  terms  may  be  required. 

Thus  far  we  have  been  assuming  sufficient  differentiability.  Versions 
of  all  theorems  are  valid  if  derivatives  are  interpreted  in  the  sense  of 
distributions .  This  is  discussed  in  some  detail  in  Section  3. 

Our  theorems  are  related  to  Hilbert’s  13th  problem.  In  modern  notation 
Hilbert  asked  if  there  are  genuine  multivariate  functions.  Of  course,  x  +  y 

l02  X  'f  l02  V 

is  a  function  of  2  variables  but  xy  =  e  e  is  a  superposition  of 

univariate  functions  and  +.  Kolmogorov  and  Arnold  showed  that,  in  this  sense, 

+  is  the  only  function  of  2  variables.  They  constructed  3  monotone  functions 

<t>^  :  [0,  1]  -►IT,  which  satisfy  j  4>^(x)  -  <J>(y)  j  <.  jx-y|.  These  functions  have 

2 

the  following  remarkable  property:  for  each  f  e  C[0,  1]  there  is  a 
g  €  C[0,  1]  such  that  for  all  (x,  y) 

n  1 

f(x,  y)  =  l  g(<p.(x)  +  J  4>.(y)) . 
i=l 

Thus  are  a  "universal  change  of  variables"  which  allows  exact  equality. 

A  nice  discussion  of  this  result  and  its  refinements  can  be  found  in  Lorentz 
(1966,  1980)  and  Vertushkin  (1977).  While  the  functions  and  g  are  given 
in  a  constructive  fashion,  it  does  not  seem  that  this  result  is  used  to 
approximate  functions  in  an  applied  context.  This  is  probably  because  the 
functions  are  fairly  "wild".  For  example,  it  is  known  that  it  is  not 
possible  to  choose  to  be  C*  functions,  so  fixed  linear  combinations  of 


10 


n 

x  and  y  are  ruled  out.  It  is  known  that  f(x,  y)  *  £  ^a-t x  +  )  f°r  aH 

i-1  1 

polynomials  f(x,y)  is  not  possible  with  a^  fixed  independent  of  f.  In 

the  probjection  pursuit  approach  to  approximation,  a^  and  b^  are  allowed  to 

depend  on  f  and  Example  2  shows  that  now  any  polynomial  can  be  written  in 

required  form.  Example  1  shows  that  not  all  functions  can  be  so  expressed. 

This  paper  has  characterized  functions  that  can  be  represented  exactly 

as  a  sum  of  non-linear  functions  of  linear  combinations.  It  is  important 

to  be  able  to  recognize  functions  that  can  be  well  approximated  by  such  a 

sum.  Some  important  work  on  this  problem  is  in  the  papers  by  Logan  and 

Shepp  (1975)  and  Logan  (1975).  These  papers  work  with  prespecified  directions, 

but  the  main  results  of  Logan  (1975)  do  not  depend  on  the  directions.  Roughly, 

2 

Logan  shows  that  a  function  on  the  unit  disc  can  be  well  approximated,  in  L  , 
by  a  sum  of  n  univariate  functions  if  and  only  if  the  function  has  bandwith 
n,  in  the  sense  that  its  Fourier  transform  is  essentially  supported  on  a 
disc  of  radius  n. 


11 


r 


2.  Proof  and  Discussion  of  Theorems  3  and  5  »  Let  L  be  the  differential 
n 

operator:  7  c.  — z - r  .  By  hypothesis,  the  pc  -nomial 

l  •  l  n-i 
i-0  3x  3y 


x>  i 


r  i  n-i  nr  (  X 1 

I  cxy  -y  l  C  (-J 

i-0  i-0  J 


,  with  the  lines  a^x  +  b^y  distinct.  It  must  be  shown  that 
n 


splits  into  distinct  linear  factors.  Thus  L  can  be  written  as 

nr  s  o  1 

Ilib.  t—  -  a.  T~ 

i  i  i  9y  j 

f  can  be  represented  as  £  g  (a.x  +  b  y)  .  The  proof  is  by  induction  on  n  . 

i-1  1  1 

For  n  -  1,  suppose  without  real  loss  that  a^  +  0  .  Then  f(x,  y)  -  g(a^x+b^y) 
with  g(z)  -  f(-~,0)  .  One  way  to  show  this  is  to  fix  (x,  y)  and  define 


h(t) 


-*1 

bi  bi  1 

x  +  —  y  -  —  yt,  ty 

.  Then  h(0)  -  f 

r  bi  i 

|  x  +  y,  0 

! 

ai  ‘i  J 

l  a  J 

gUjX  +  bjjr) ; 


h(l)  -  f  (x,  y)  and  h'  (t)  =  0  ,  for  0  <  t  <  1.  The  fundamental  theorem  of 


calculus  gives  h(l) 


l1- 


+  h(0)  •  Suppose  the  result  is  true  for  operators 


of  degree  <.n  -  1  .  To  prove  it  for  degree  n  ,  write 

n-1 


n 

n 

i-l 


.  ±  _  ± 
n  3x  an  3y  J 


f  =  0  . 


By  the  induction  hypothesis,  there  are  functions  g^,  1  <.  i  <.  n-  1  satisfying 

(2.1) 


K  -L  _  a  _Ll 

[  n  3x  n  3y 


n-1 

f  -  l  8i(aix+l> ty) 

i-l 


A  solution  f  of  (2.1)  of  the  form 


n-1 


f*(x,  y)  -  l  h.  (a  x+b  y) 
i-l 


is  found  by  choosing  h.  (t)  -  (b  a.  -  a  b.) 

i  n  l  n  l 


g.  (s)ds  •  This  is  well  defined 

U  X  U  A  X 

f  g  0  3  1  & 

because  the  lines  are  distinct*  Now  i  b  -r - a  r-Wf-f*)  =  0  can  be  solved 

^  a  3x  n  3y  J 
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explicitly  with  (f-f  )(x,  y)  -  h^a^x+b^y)  by  the  argument  for  n  -  1.  It 
follows  that  f  ■  f*  +  hfl  can  be  written  in  the  required  form.  ■ 

Remarks  on  Explicit  Computations.  If  f  is  of  the  form  (1.2)  then  Theorem  3 


gives  the  existence  of  numbers  crt,  ...,c  such  that  c. 
°  O’  n  j 


3x19ytl"1 


(f)  =  0. 


The  c^  can  be  found  by  fixing  n  +  1  distinct  pairs  (x^,  y^)  ,  calculating 

3°  I 

— — r  and  solving  the  resulting  system  of  equations  for  c  . 

3x  3yn  | (xi#  y ^ 

It  is  feasible  to  check  if  the  polynomial  crt  4*  . . .  +  c  zn  has  distinct  real  roots 

0  n 

using  techniques  in  Chapter  6  of  Henrici  (1977).  Each  stage  of  the  procedure  is 

feasible  by  a  finite  algorithm.  If  the  procedure  fails  at  any  stage,  then 

equality  is  impossible.  Given  feasible  cn,  c  ,  it  may  be  possible  to  find 

0  n 

the  roots  of  the  associated  polynomial.  This  determines  directions  (a^,  b^)  . 

In  simple  examples  there  is  often  enough  freedom  of  choice  to  make  deter¬ 
mination  of  (a^,  b^)  possible.  Consider  f(x,  y)  *  xy  for  n  *  2  , 


n 

i»l 


3f  3f 
bi  3x  ai  3y 


*  blb2  y  '  (bla2+b2al)  W  +  V2  ^2 


2  2  2 
c<  9  f  .if..  3  f 

"  3x2  3y2  ' 


1  ;  any  distinct  choice  of  a^  and  b^  with 


bj^  *  ”^2ai  wor^3.  Taking  =  b^  *  1,  a^  *  -b^  =  1  ,  we  are  led  to  solve 


f(x,  y)  «  g^x+y)  +  g2(x-y) 


Applying  -r - t—  to  both  sides  leads  to  y  -  x  -  2g  (x-  y)  ;  setting  y  ■  0  ; 

dx  dy  9  Z 


2  2 
x  n  f  f 


g2(x)  ■  -  J,  g2  -  +  c2  .  Similarly,  g^(x)  -  —  +  and  the  result  is 


12  12 

xy  -  -j  (x  f  y)  +  c^  -  *jr(x-y)  +  c^  where  +  c2  “  0  is  forced.  In  general, 

n  r.  3  3  1  .  (n-1),  ..  ,  f 

bi  37  '  ai  37  f  "  ci*i  (aix+  biy)  »  for  an 


if  f  -  l  g  (a  x  +  b.y)  ;  II 
1-1  1  1  1  j*i 

explicit  .  This  determines  g^  up  to  an  essentially  free  choice  of  an 


n  -  1  degree  polynomial. 
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r 


r  '  I 


In  the  case  of  a  polynomial  f»  some  additional  tricks  become  available. 

a  b  n  n 

For  a  multinomial  x  y  let  a  +  b  •  n;  only  sums  of  the  form  £  a.  (x  +  6  y) 

i»l  1  3 

need  be  considered.  Expanding  out  and  equating  coefficients  gives 

la.  -  0,  Ea.B,  -  0  ...  Ea,B®  -  — —  ...  Ea,8"  -  0. 

1  11  l)  (J)  1J 

This  gives  n  +  1  equations  in  2n  unknowns.  These  are  linear  in  the  a's  for 
given  8’s  and  may  be  solved  explicitly  because  the  matrix  is  a  Vandermonde 
with  a  well  known  inverse.  See  Gautschi  (1963). 


The  proof  of  Theorem  3  was  outlined  by  H.  Roydan.  The  proof  follows 

r  3 

from  three  lemmas.  Throughout  a  are  distinct  non-zero  directions  in  1R  . 

I 

LEMMA  1.  If  £  g  (ar  •  x)  5  0  then  ^  =  0  and  g^  is  a  polynomial  of 
r*l  r 

degree  at  most  4-2. 


Proof .  Fix  r.  For  each  j  £  r  there  is  €  II ^  but  •  ar  #  0.  Apply 
the  operator  II  (p^*V)  to  the  sum  to  conclude  II  (p^  *a^)g^  ^  (p^  •  x)  =  0. 
The  coefficient  is  non- zero,  so  the  conclusion  follows.  ■ 


In  the  next  two  lemmas,  the  notation  f .  means  -5 —  . 

3  Bxj 

LEMMA  2.  Let  P  and  Q  be  polynomials  of  degree  £k  in  (x^,  x^x^).  Suppose 

3 

that  in  some  open  0  <=  1R 


P 


3 


-  Q 


2  ' 


Then  there  is  polynomial  of  degree  at  most  k  +  1  such  that 

P  •  H2  and  Q  -  H3  . 
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Proof.  Argue  in  a  cube  (a<.x^<.a,  c<_x^<.y}  contained  in  0.  Let 

F  be  a  path  connecting  (x^,  a,b)  to  (x^f  x^»  x^)  which  lies  entirely  in  the 


plane  of  constant  x^.  The  line  integral 


* 

P(X  ,  y , 

Jr 


z)dy  +  Q(x,  y,  z)dz  ■  HCx^  x2>  xj 


is  independent  of  T  in  view  of  the  hypothesis  and  Green's  theorem.  Further¬ 
more, 


dH  =  Pdx2  +  Qdx3 


3H  .  3H 

r —  =  P  and  - —  =  Q 
dx2  dX^ 


In  particular,  let  T  be  the  path 


t  •*  (xL>  a  +  t(x2  -  a),  b)  ,  0  <  t  <  1 

t  (xx,  x2 ,  b  +  (t-l)(x--b)),  1  <  t  £  2 


H(x^ ,  x2>  x3>  =  (x2  -  a)  P(x3  ,  a  +  t  (x2  -  a)  ,  b)dt 


(x3  -  b)  P(x^,  x2,  (t  -  1) (x3~  b))dt 


with  is  clearly  a  polynomial  of  degree  £k  +  1. 


n+2 

LEMMA  3,  Let  f  e  C  (IR  )  have  the  following  properties;  for  n  distinct 
directions  ar,  with  ar  distinct  from  (1,  0,0), 


(2.2a) 

f2Cx>  -  I 

gr(ar  •  x) 

+  P(x) 

r-1 

n 

(2.2b) 

f3w  -  I 

hr(ar  •  x) 

+  Q(x)  . 

With  P  and  Q  polynomials  of  degree  at  most  n  -  1;  then  there  are  univariate 
functions  G^,  1  <,  r  <,  n+1,  and  a  polynomial  Q  of  degree  at  most  n  such 
that 

n 

f(x)  »  l  Gf(ar  •  x)  +  G^Cx^  +  Q(x)  • 

r*l 

Proof.  Because  conditions  (a)  and  (b)  translate  into 

n  1  1 

o  -  l  (a^gr  -  a^)  (ar  •  x)  +  P3(x)  -  Q2(x)  . 


By  hypothesis,  for  i  i  r  there  are  vectors  p *  e  II*  with  pi  *  ar  j  0*  Let 

A  *  IT  (p**V).  Applying  A  to  P  (x)  -  Q  (x)  gives  zero  because  this  poly- 
i^r  3  2 

nomial  is  of  degree  at  most  n  -  2.  Thus,  Lemma  1  implies 


where 


It  follows  that 


0  i  c| -  .&»>) 

c  ■  II  (p*  •  ar)  4  0 . 
i*r 


(2.3) 


a^gr(ar  •  x)  -a*  (ar  •  x)  -  Pr(ar  • x) 


for  P  a  polynomial  of  degree  at  most  n  -  1. 
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i 


Because  the  ar  are  distinct  from  (1,  0,0),  either  1*  0  or  ^  0.  Define 

the  n  functions  G  by 
r 

(2.4a)  =  gf/ if  a*  ¥  0 

(2.4b)  Gr  =  tWalj  if  a*  =  0. 

Consider 

n 

4>  (x)  =  f(x)  -  J  G  (ar  •  x)  . 

r=l  r 

From  the  hypothesis,  (2.3)  and  (2.4), 

«j>2(x)  =  f  2  (x)  -  Ea2G\ar.x)  =  £  gr(ar  •  x)  +  P(x)  =  P*(x) 

a2=0 

with  P  a  polynomial  of  degree  at  most  n  -  1.  Further, 

<f>3(x)  =  f3(x)  -  Ea3G'(ar  •  x)  =  5:{hr(ar  •  x)  -  aljc'  (a**  •  x) }  +  Q  . 

Each  term  in  the  sum  is  a  polynomial.  T  f  a^  4  0  the  rth  term  equals 

hr (a  •  x)  -  a3gr (a  •  x) /a? 

a  polynomial  from  (2.3).  If  *  0,  the  rth  term  is  zero.  It  follows  that 

Q3  =  Q*(x) 

with  Q*  a  polynomial  of  degree  at  most  n  -  1.  To  finish  off,  observe  that 
^23  *  ^32  gives  P3  *  Q* *  From  Lemma  2,  there  is  a  polynomial  H  of  degree 
at  most  n,  such  that  H2  »  P*  and  H3  *  Q*.  The  function  ty  -  $  -  H  has 
^2  ”  ^3  *  0*  Thus,  ip  is  only  a  function  of  x^,  as  required. 
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Proof  of  Theorem  5*  Clearly,  if  f  can  be  represented  as  a  sum  of  n  uni¬ 
variate  functions  plus  a  polynomial  the  differential  operator  kills  f. 

The  proof  of  the  converse  is  by  induction  on  n.  For  n  =*  1,  we  know  that 
if  fj  ■  f j  ■  0  then  f  is  a  function  of  x^  only.  Rotating  to  bring  the  plane 
n1  into  {P  :p2*P^*0}  proves  the  general  case.  Suppose  that  the  result 
is  true  for  n  -  1.  Let  a\  an  ^,an  be  n  distinct  non-zero  directions. 

By  rotating,  we  may  assume  that  an  *  (10  0).  Then,  for  any  p*  e  31*, 

1  <  i  <  n-i, 

JKP1  •  V)f2  =  IKp1  •  V)f3  =  0  . 

The  induction  hypothesis  yields  that  f  satisfies  conditions  (2.2)  of 
Lemma  3.  The  theorem  follows.  ■ 

PROPOSITION  1.  Let  p  and  k  be  positive  integers.  Let  r  * 

There  are  r  distinct  directions  a\  ,  ...»  ar  in  IR^  such  that  any  homo¬ 
geneous  polynomial  f  of  degree  m  can  be  written  as 


m+  p  -  1 
m 


f(x) 


l 

j-1  J 


x) 


m 


for 


some  real  numbers  a  . 

r 


Proof.  The  space  of  homogeneous  polynomials  of  degree  m  is  an  r  dimensional 
vector  space  over  the  real  numbers.  Let  m^(x),  1  <  i  <  r  be  an  enumeration 
of  the  monomials.  For  each  monomial,  let  D  be  the  associated  differential 

2  V 

operator  (e.g.,  if  ra  (x)  -  x^x0x«,  D.  -  — x - ).  Observe  that 

1  1  1  J  1  3xpx23x3 

D^(ar  •  x)m  ■  m!m^(ar).  For  dimension  reasons,  to  prove  tha  proposition  it 
suffices  to  show  that  directions  can  be  chosen  so  that  the  polynomials 
(a^  •  x)m,  J  ■  1,  *..,r,  are  linearly  independent.  Suppose 

(a^  •  x)m  -  0  . 
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Applying  Dj  we  get 


r  , 

£  m  (aJ)c.  ■  0  for  all  i  -  1, 

j-1  3 

For  this  system  to  have  a  non-trivial  solution  (c^,  Cf)  we  must  have 

(*)  det(mi(a3))  *  0 

We  write  a3  -  (a3,  ....a3).  Since  detCm^a3))  is  a  non-trivial  algebraic 
expression  with  rational  coefficients,  if  we  choose  the  pr  real  numbers  a3 
in  such  a  way  that  they  are  algebraically  independent  over  Q  ,  then 

detCm^a3))  j*  0. 

contradicting  (*) .  ■ 
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3.  Some  Generalizations 

The  theory  presented  so  far  does  not  apply  to  identities  between  non- 
dif ferentiable  functions.  Most  of  the  results  remain  valid  if  differentia¬ 
tion  is  interpreted  in  the  sense  of  distributions.  Consider  the  identity 

max  (x,  y)  «  (|  x+  yj  +  |  x-  y| ) . 

For  fixed  y  the  function  max  (x,  y)  is  constant  for  x  <.  y  and  equal  to  x 
for  larger  x.  Thus 

3  ,  ,  j° x< y 

—  max  (x,  y)  *  < 

1  x  >  y 


This  function  is  also  called  the  Heavyside  function  shifted  to  y.  Its 


derivative  is  well  known  to  be  the  delta  function  concentrated  on  the  line 


oo  2  . 

x  *  y.  This  acts  on  4>  e  Cn  (IR.  )  by  6(cf>)  =*  / 

2  U 
a 

— —  max  (x,  y)  =  <5 ,  so  max  (x,  y)  is  a  solution 

ay 


<^(tf  t)dt.  Similarly, 
of  the  wave  equation 


U - 2  U  *  0  . 

dx  3y 


The  only  solutions  of  this  equation  are  of  the  form  U  *  f^(x  +  y)  +  f^fx-y) 
where  f^  and  f^  are  distributions  (see  Schwartz  (1966),  p.  8  for  some  history). 
We  further  show,  here  and  more  generally,  that  if  the  solution  U  is  a  suffi¬ 
ciently  well  behaved  function,  then  the  f^  are  functions. 

Any  undefined  terms  in  the  following  discussion  can  be  found  in  Barros- 
Neto  (1973)  or  Schwartz  (1966).  Let  P(1R  )  be  the  space  of  test  functions  - 

oo  |  2 

compactly  supported  C  functions.  The  dual  space  V  (IK  )  Is  the  space  of 
distributions  on  1R2.  For  y»  (a,b),  the  translate  of  T  c  V'  by  y  is  written 
T  .  This  acta  on  <{>  e  V  by  T^{4>(x)]  *  T{<J>(x-Y)}.  The  distribution  T  e  P'(IR2) 
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depends  only  on  ax  +  by  if  for  all  real  t ,  T ,  *  T.  The  following 
- —  (bt,-at)  - 

theorem  collects  together  several  results  in  Section  11.5  of  Schwartz  (1966). 
It  is  the  case  in  *  1  of  the  theorem  we  are  aiming  at . 

i  2 

THEOREM  6.  Let  T  e  V  (IR  )  .  For  (a,b)  non-zero,  the  following  conditions 
are  equivalent 

(a)  T  depends  only  on  ax  +  by. 

(b)  (bl;-ai>T-o. 

(c)  There  is  a  distribution  g  e  V  (]R)  such  that  for  all  <t>  £  0(IR")  f 

T(4>)  3  g(  ^(an-fbv,  bu-av)dvl 
where  g  operates  on  the  function  of  u  inside  the  brackets. 


2  2 

Remark.  If  T  and  g  are  functions  with  T(x,  y)  =  g(ax+by)  and  a  +b  *  1 , 
then  part  (c)  becomes 


T(x,  y)<Kxf  y)dxdy 


g(ax  +by)4>(x,  y)dxdy 


g(u)0(au+  bv,  bu-av)dudv  *  g{  ]  <£(au  +  bv,  bu-av)dv). 
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»  2 

THEOREM  7.  Let  T  e  V  (IR  ).  Suppose  that  for  some  real  numbers  c_,c.,  .  ..,c  , 
m  .m  01m 

r*  3  i. 

the  operator  £  c.  — z - 7  applied  to  T  is  zero*  If  the  polynomial  Zc.z 

i-0  1  3xi3ym'i  1 

has  distinct  real  zeros  then  there  are  distinct  non-zero  Y,  .  Y~ . Y  ; 

Yj  ■  (a^  bj)  such  that,  writing  for  qY  , 


m 


(3.1) 


T  -  l  q^g.)  with  g  e  V*  (IR)  . 
i-1  1 


/■  3  S 

Conversely,  if  (1)  holds,  then  H(b^  —  -  applied  to  f  is  zero. 


Proof .  One  direction  is  clear;  the  argument  for  the  other  direction  is  by 

induction  on  m.  The  case  to  **  1  follows  from  Theorem  6.  Thus  assume  the  result 

2  2 

for  m-1.  Without  loss  of  generality,  assume  a^+b^*l  for  l£i£m.  Then 

3y  ;  ^1  3x  °1  3y 


n2  ( bi  3x  ai  3y  ^  bl  3x  "  al  L  3y  ^  f  0  * 
i»2 


This  implies 
(3.2) 


aii^f  =  A  n(*i)  • 


i*2 


We  will  show  that  (3.2)  has  a  solution  f  of  form 


f*  -  l  q!(h  )  . 

i-2 

T  9  * 

Supposing  this,  (b^  ~  aj^  -g^)(f-f  )  •  0,  so  by  the  result  for  m  =  1, 

jl  £  * 

f_f  .  q  (g^)  for  gj^  €  &  (R)  giving  the  theorem.  To  complete  the  proof, 
let  be  a  distribution  solution  to 


dh. 


dt  bjS^  -  ai*>i  f 


A  solution  exists  by  Theorem  IX,  p.  130  of  Schwartz  (1966).  We  claim 
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r  *  * 

l  q.OO  -  f 
i=*2 

is  a  solution  to  (3.2).  To  show  this,  we  need  the  following  relation: 

aq  >  *  3x  q  (h>  • 

CO  2 

To  prove  this,  consider  $  *  Cq(H  Now 

*  »  «  r 

aq  (h  )($)  =  ah  (  4>(au+bv,  bu  -  av)dv) 


=  -h(  [a  (f>^(au  +  bv,  bu  -  av)  +  b  4)2(au+bv»  bu-av)]dv). 


Since  (J>  is  compactly  supported 


^  (<{>(au  +  bv,  bu-av))dv  =  0. 


Thus, 


[b  <t>^(au+bv,  bu  -  av)  -  a  ^(au  +  bv,  bu  -  av)dv  *  0. 


Using  this 


*  ' 


aq  (h  )  C4>)  *  -h( 


[a  4>^(au  +  bv ,  bu  -  av)  +  bZ  (^(au+bv,  bu-av)]dv), 


-  -h( 


4^(au  +  bv,  bu-av)dv 


-k  q*(hm) 


This  relation  implies 


(bi  ’k  “  ai  &  qi(V  * 


The  claim  regarding  f  follows. 


■  i 

23 


i 


/  3  3  \  *  *  * 

Thus,  1  ^  *  ai  =  *  The  claim  regarding  f  follows,  * 

The  next  theorem  shows  that  if  the  equation 

n  * 

f(x,y)  *  l  q.U.,) 

i-1 

holds  in  the  sense  that  the  two  sides  are  equal  as  distributions,  and  if 
f(x,y)  is  a  sufficiently  regular  function,  then  each  of  the  distributions 
g^  can  be  realized  as  a  function  on  1R.  Theorems  of  this  sort  may  be 
described  as  results  on  propogation  of  singularities  of  partial  differen¬ 
tial  equations. 

The  notion  of  sufficiently  regular  we  adopt  involves  the  Sobolev  spaces 

s  00 

H  ;  the  definitions  involve  Fourier  transforms,  and  so  the  space  g  of  C 

functions  that,  together  with  all  derivatives,  tend  to  zero  at  infinity  fas- 

1 

ter  than  any  polynomial.  The  dual  of  g  ,  denoted  g  is  the  space  of  tem- 

2 

pered  distributions.  The  Fourier  transform  of  0  *  g(JR  )  is 


}(X) 


e  ^  x  0(x)dx 


where  dx  is  1/271  times  Lebesgue  measure.  The  Fourier  inversion  theorem 
becomes 


4>(x) 


eiX’x  4>( A)d A  . 


The  Fourier  transform  of  a  tempered  distribution  8  c  g  is  defined  by 


0(4))  -  6(0)  . 


s  2 

For  real  s,-00  <  s  <  00  f  the  Sobolev  space  H  (F  )  is  the  set  of  tempered 
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distributions  0  t  S  (R^)  such  that  (1  4*  joj2  +  |r)|2)s^2  f(o#n)  <  L2(R2). 
There  are  various  embedding  theorems  that  say  when  a  distribution  is  a 
function.  For  example,  Section  3  of  Chapter  1  in  Taylor  (1980)  gives 

a)  If  s  >  n/2  ,  then  each  0  <r  H  (R  )  is  a  bounded  continuous 
function  that  vanishes  at  infinity. 


b)  If  s  >  n/2  4-  k  ,  then  HS(Rn)  c  C^(Hn)  . 

CL 

c)  For  0  <  ci  <  1  ,  define  C  as  the  set  of  bounded  functions  u 
such  that  |u(x+v)-u(y)  j  <  C]y|a  for  |  y  ]  <  1.  If  s  = 

0  <  a  <  1  ,  then  H2(®n)  cCa(Bn). 

d)  Tf  0  '  S  ''  "  .  HS(Kn)  C  Lq(  Rn)  ,  q  = 

—  n  —7  c 


We  have  chosen  the  route  of  interpolating  between  integer  values  of  s  by 
means  of  the  Fourier  transform.  There  are  other  routes.  See  Adams  (1975) 
for  discussion. 

2  s 

For  U  open  in  R  ,  H^^dJ)  is  the  set  of  distributions 

1  oo  s  ? 

O  c  £  (U)  such  that  for  each  compactly  supported  ^  c  Cq(U)  ,  $  •  9  •  H  (R“). 

1  2 

For  example,  max(x,y)  c  H^oc(lR  With  this  notation,  we  can  state  the  main 
result . 


THEOREM  8.  Let  y^  =  (a^,b^) ,  1  <  i  <  m  be  distinct  non-zero  directions 
2 

in  R  .  Let  denote  projection  in  the  direction  y^  ,  so  q^(x,y)  = 

2 

x  +  b^  y.  Let  U  be  open  in  R  .  Let  be  open  sets  in  1R  with 

q^(U^)  U  for  all  i.  Suppose  f  f  Hi0c^  Can  written 


m 


f  ■  l  qi<8t) 

i=l 


where  gi  €  £  (up .  Then  g±  t  H^CU^ 
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The  proof  of  Theorem  8  will  be  given  following  two  preliminary  lemmas. 

2 

Let  (a,b)  be  a  unit  vector  in  H  .  Let  q(x,y)  *  ax  +  by  denote 

*  * 

the  projection.  For  g  t  §  (H)  ,  the  distribution  q  g  acts  on 

*  €  S(I^)  as  g(/  ^(au+bv,  bu-av)dv).  The  distribution  p^g  acts  on 
2 

functions  c  S(R  )  by  g(ip(at ,bt)) .  We  have 
LEMMA  4 .  (q  g)  *  p*(g) . 

Proof.  q*g(^)  *  q*g(^)  *  gif  ^(au+bv,  bu-av)dvl.  Now  the  integral 
equals 


fe-i(au+bv)x  -  i(bu-av)y  d(Xfy)dj|dydv 


f  -i(ux-by)u-  (bx-iy)v  ,  , 

e  J  <Kxfy)dxdydv 


e  isU  ^tV  4>(as+bt,  bs-at)dsttdv 


-isu/ 


e  *tV  4>(as+bt,  bs-at)dtdv)ds 


lv|c 


The  inner  integral  equals  4>(as,bt)  ;  indeed  for  any  function  g  c  C^(K) 
/  e  g(t)dtdv  »  6(g)  *  6(g)  *  g(0).  Making  this  substitution,  proves 
the  result .  Q 

The  next  lemma  is  the  case  m  *  1  of  Theorem  8. 

LEMMA  5.  Let  it  :  1R^  *+  ®.  be  the  projection  Tr(x,y)  *  x,  U1  c  H  and 

-1  f  1 
U  z>  tt  (U^) .  Let  T  e  £  (U)  and  assume  there  is  g  e  £  (U^)  such  that 

T  -  7T  (g)  . 
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f 


Lt  r  ,IocU,  .  then  g  , 

Proof .  Without  loss  of  generality  we  may  assume  U  *  n.  ^(U^)  since 


T  -  rr  (g).  Let 

Vi  c  Vi  c 

U1 

with  Vx  open. 

<( 
►— * 

compact , 

and 

X  < 

C0<U1> 

with  X  1  on 

Vx .  Then 

it 

suffices  to  show 

Xg 

-  HS(F). 

Let 

f  = 

Xg 

and  6  =  tt  (\g).  Then  the  hypothesis  on  T  implies 


Let  4>(x,y)  =  ^(y)  and  ^  >  CqOR)  be  such  that 


(y)  = 


|y  i  i  r 

|y  2  R 


for  some  constants  0  <  r  <  R.  Then  <*>\  <  Cq(U)  S° 

4>0  «.  HS(R2)  . 

/\ 

This  means  4>0  is  a  function  and 


(3.3) 


|(W(p,n)|2(l+  |oi2+  |n|2)s/2dpdn  <  »  . 

i  J 


We  will  argue  that  (3.3)  implies  f  <:  HS(F).  Lemma  4  implies  that  0=p*(f). 
Moreover,  f  is  an  analytic  function  of  one  variable,  being  the  Fourier 
transform  of  a  distribution  of  compact  support  (see  Section  4.5  of  Barros- 
Neto  (1973)).  Thus 


(3.4) 


0(p,n)  ■  f(p)  6Q(n)  • 


Also 
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(3.5) 


<Kp,n)  “  <50(p)  4>x<n)  • 

/N  /v  ^ 

Using  ($0)  *  <t>  *  0  With  (3.4)  and  (3.5), 

(4>e)(p,n>  -  ^(n)  *  f(p)  . 

Thus  (3.3)  becomes 

(3.6)  [[  k,(n)|2  |f(p)t2  d+  |p[2  +  ln|2)8/2  <  •  . 

# 

Now  elementary  arguments  show  that  for  any  real  s  there  are  positive  con¬ 
stants  p^,  such  that 


Proof  of  Theorem  8.  We  may  assume  are  unit  vectors.  The  case  m»l  follows 
from  Lemma  5  a  linear  transformation.  For  the  general  case,  suppose 
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Let  D,  *  II.,.  (b,  -  a.  r~) .  Then,  for  a  non-zero  constant  C,  , 

i  jr  i  j  >x  ]  >y  i 


Dif  *  Diq*(8i)  “  ciqi(8im  l>)  • 


s-m+1 . 


Since  f  r  H°qc(U)  .  Dtf  1  Hloc(U)‘  By  L®”®3  5* 


(m-1)  s-m+1,  \ 

8i  ‘  Hloc  'V  * 


This  implies  that  t  (U^)  *  see  ^or  examP^-e  Theorem  7.6  in  Treves 

(1966).  Q 
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